SUPPLEMENTARY NOTE 



1. LD Score in an Unstructured Sample 

1.1. Model. We model phenotypes as generated from the equation 

(1.1) (j> = X(3 + e, 

where <f> is an N x 1 vector of (quantitative) phenotypes, X is an N x M matrix of genotypes normalized 
to mean zero and variance one (we ignore the distinction between normalizing and centering in our 
sample and in the population since the error so introduced has expectation zero and &(1/N) variance), 
f3 is an M x 1 vector of per-normalized-genotype effect sizes and e is an N x 1 vector of environmental 
(or non-genotyped genetic) effects. We describe a model where all three variables on the right side of 
equation 1.1 are random. In this model, E[e] = 0 , Var[e] = (l-h 2 g )I, E[f3] = 0 and Var[/3] = (/i^/M)/ 1 . 
To model genotypes, we assume that the genotype at variant j for individual i is independent of 
other individuals' genotypes, but we do incorporate linkage disequilibrium into the model: define 
rjk ■= E[XijXik], which does not depend on i. Finally, we assume that X, j3 and e are mutually 
independent. We will relax the assumption that environmental effects are independent of genotype 
when we model population stratification in §2.2. 

1.2. Relationship between LD and x 2 -Statistics. For each variant j = 1,...,M, we compute 
least-squares estimates of effect size f3j :— Xj<j>/N (where Xj denotes the N x 1 vector of genotypes at 

variant j) and x 2 -statistics Xj '■— Nffj. In this section, we compute E[Xj] with the expectation taken 
over random X, (3, e. 

Since E[(3j] — 0, observe that E[xj] = N ■ Var[/3,-]. We will obtain the variance of (3j via the law of 
total variance: 

(1.2) Vax\j3j] = E[Var[/3j | X]] + Var[E[/3,- 1 X]] 

= E[Var[4-|X]], 

where the second line follows from the fact that E[/3j \ X] = 0 irrespective of X. First, 

(1.3) Var^-lX] = ^Var[X/^ | X] 

= ^XjY^\X]X J 

= ^[§ X J XXT ^ +N ^- h ^)- 



This is the same assumption made in [1]. If one wishes to specify a different variance structure for the per-normalized- 
genotype effect sizes, e.g., Var[/3j] = fj, then all results presented herein hold with normalized genotypes (G;j — 2pj)/^/~fj 
replacing the usual (Gij — 2pj)/ \/2pj(l —pj), where Gij denotes additively coded (0,1,2) genotypes. 
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where fjk '■= -tt Y] XjjXjk denotes the sample correlation between variants j and k. Define the LD 



We can write the term on the left in terms of more familiar quantities as 

M 

(1-4) —X]XX^X 3 =^%, 

k=i 

1 N 

Score of variant j as 

M 

(1.5) i r . £>? fc . 

k=l 

Since 

(1.6) E[f%] « r% + (1 - r%)/N 

(where the approximation sign hides terms of order 0(l/N 2 ) and smaller; one can obtain this approx- 
imation via e.g., the (5-method), 



(1.7) E 
Thus, 



M 



fe=l 



* M - £ 1 



N(l-l/N)hl 

(i-8) nx)\ « M 9 t 1 + i 

Nhl 

M J ' 

Values of N considered in the main text generally fall between 10 4 and 10 5 , so the approximation 
1 — 1 /N ss 1 is appropriate. 

2. LD Score with Population Stratification 

2.1. Model of Population Structure. We model population structure induced by genetic drift in 
a mixture of two populations in equal proportions. We draw a matrix of normalized genotypes X 
consisting of N/2 samples from population 1 and N/2 samples from population 2 (we will use the 
notation i € P m for m <E {1,2} to denote that individual i is a member of population m), subject to 
the following constraints: Var[X i: ,] = 1, E[Xjj | i e Pi] = fj and E[JQj \i e P 2 ] = —fj- We model 
the drift term / as / <~ N(0, FstV), where V is a correlation matrix and Fst is Wright's Fst [2]. 
We postpone discussion of the off-diagonal entries of V (which might depend on LD in the ancestral 
population or recombination rates) until §2.2. Finally, if t^ m denotes the LD Score of variant j in 
population m, we assume that £j t i w £j t 2 ='■ £j- The last assumption warrants a brief explanation. 
Assuming approximately equal LD Scores in both populations is certainly not reasonable for very large 
values of Fst (e.g., if population 1 and population 2 are from different continents) or in scenarios where 
one population has passed through a more severe bottleneck than the other (e.g., if population 1 is 
from Finland and population 2 is from West Africa). However, we are interested in modeling the 
population stratification that may remain after principal components analysis 2 in GWAS that sample 
from non-admixed populations, and for this purpose the assumption that i^i w seems reasonable, 



2 Including principal components as covariates in GWAS is equivalent to regressing PC-residualized phenotypes against 
PC-residualized genotypes. PC-residualized genotypes will have more homogeneous LD structure than the raw genotypes. 
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and is supported by the large values of R 2 (i^ m ,lj^ n ) that we observe for all pairs (m, n) of 1000 
Genomes European subpopulations. 

For reference, typical values of F$t f° r human populations are w 0.1 for populations from different 
continents [2], as 0.01 for populations on the same continent [3], and < 0.01 for subpopulations within 
the same country [4]. 

2.2. LD in a Mixture of Populations. Suppose j and k are unlinked variants such that rjk i = 
r jk,2 = 0 and fj is independent of f k . In a mixture of populations, it will often hold that j and k 
will be in LD in the whole population even if they are in equilibrium in both component populations. 
Given fixed /, 

(2.1) EM/] = nXijX ik \f] 

= l - (E[XyX ifc \f,i e Pi] + E[X l3 X lk \f,ie P 2 \) 

= f j fk- 
ll we take the expectation over random fj and fj, then E[r-jfc] = 0, because fj and f k are independent 
with expectation zero. We can use equation 2.1 to compute the variance, 

(2.2) VarM - Var[E[r jfe | /]] + E[Var[r jfe | /]] 

= E[/?/2]+0 

= nffnm 

= Fgrp. 

Observe that since E[rj fc ] = 0, Var[r jfe ] = E[r| fc ]. By equation 1.5, in a finite sample, 

(2.3) E[r%]*Fl T + (l-Fl T )/N. 
Thus the sample LD Score is approximately 

M(l - F| T ) 



TV 



(2.4) E[£j) » £ 3 +MF§ T + 

» lj+MF 2 ST +M. 

Note that we have ignored the case where j and k are linked and Vjt ^ 0. In this case, E[/J/|] = 
Fj T +2i<sTV J 2 fc (from the formula for the double second moments of a multivariate normal distribution). 
Even if for some variants j, the number of variants k such that Vjk > 0 is w 10 3 , this will make a 

negligible difference in E[t,], because £ 2F ST F/ fe < 2000F ST < MF S t when M w 10 7 . 

fe:y 3fc >o 

2.3. Model of Stratified Phenotype. We model phenotypes as generated by the equation 

(2.5) ^ = Xf3 + S + e, 

where X is as described in §2.1, (3 is as described in §1.1 and where we introduce an environmental 
stratification term S, defined by 

(2.6) S t := 
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Finally, e is as described in §1.1, except Var[e] = (1 — h 2 — a 2 ), which assures that the variance of 
<p in the population is 1. Note that we have implicitly required that parameters be chosen such that 

2.4. Relationship between LD and Stratified x 2 -Statistics. We compute x 2 -statistics as defined 
in §1.1. In this section, we compute E[yJ] with the expectation taken over random X, j3, e, f but with 

S fixed to ensure population stratification. Since E[(3j] = 0, observe that E[x|] = N ■ Var [/?,]. We will 

obtain the variance of (3j via the law of total variance: 

(2.7) Var[/y = E[Var[& | X}} + Var[E[&- | X]}. 

Note that one can calculate / from X, so by conditioning on X we also implicitly condition on /. 
Unlike in equation 1.2, E[f3j | X] ^ 0, because of confounding from population stratification. The inner 
portion of the first term on the right side of equation 2.6 is the same as in equation 1.3, 



(2.8) Var^-lX] = ± \%X]XX T Xj + N(l - h 2 ) J . 

We can take the expectation over random X, p, q, using the result from §2.2 that in a sample from a 
two-way mixture of populations, 

(2.9) ±- 2 nx]XX T X 0 ] « £j + MFg T + M 
Thus, 

(2.10) ElVar^lX]] = ^ (^E[Xj XX T X 3 ] + N(l h 2 ^ 

Next, the inner portion of the second term on the right side of equation 2.6 is 

(2.11) nPi\X\ = ^E[XjXf3 + XjS + Xje] 

N 3 

= f°s- 

Since / has variance Fst, Var[/o s ] = <j 2 Fst- Thus, 

(2.12) E[ Xj 2 ] = N-Vaxfa] 

Nh 2 

-^-Ij + 1 + NF ST (a 2 + h 2 g F ST ). 

We can interpret the final term, NFst{<^s + ^I^st), as NF$t times the expected squared mean 
difference in phenotype between populations, which has environmental component <j 2 and genetic 
component h 2 Fsr (if we model X , /3 and / as random, there is zero genetic stratification on expecta- 
tion, but with some small variance about zero). Precisely, if we let <j) m denote the mean phenotype in 
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population m e {1,2}, then 



(2.13) 



a s 2 + ^ E[/3|] £e[X?- I * e Pi] + £ E[X 2 I * e p 2] 




M 



) 



Set a 



E[Oi - 02) 2 ]- Then we have 



+ 1 + aNF ST . 



(2.14) 



M 
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